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Summary: Let a sequence of iid. random variables £1, . . . , £ n be given on a mea- 
surable space (A, X) with distribution \i together with a function f(xi, . . . , Xk) 
on the product space (A fc , X k ). Let [i n denote the empirical measure defined 
^ ■ by these random variables and consider the random integral 

M ■ n fc / 2 [' 

Jn,k(f) = J f( u U ■ ■ ■ ,Uk)(Vn(dux) - n(dui)) . . .(n n (duk) - n(du k )), 

where prime means that the diagonals are omitted from the domain of integra- 
tion. In this work a good bound is given on the probability P(\J n< k(f)\ > x) 
for all x > 0. It is similar to the bound in the analogous problem we obtain by 
considering the Gaussian (multiple) Wiener-Ito integral of the function /. The 
proof is based on an adaptation of some methods of the theory of Wiener-Ito 
integrals. In particular, a sort of diagram formula is proved for the random 
integrals J n ,k(f) together with some of its important properties, a result which 
may be interesting in itself. The relation of this paper to some results about 
>- 1 [/-statistics is also discussed. 
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The investigation of the questions discussed in this paper was motivated by some prob- 
lems of non-parametric maximum likelihood estimates. (See [5], [6].) The following 
problem is studied here: Let /ibea probability measure on a measurable space (A, X), 
and take a sequence £i,...,£ n of independent, identically distributed (A, X) valued 
random variables with distribution \i. Let us introduce the empirical measure /i n , 

Hn(A) = -#{j: & e A, 1 < j < n}, AeX 
n 



of this sample £i, . . . , £ n , and given a function f(xi, . . . , Xk) on the /c-fold product space 
(A fc , X k ) define the integral J n ,k(f) of the function / with respect to the /c-fold product 
of the normalized empirical measure \x n by the formula 

n k/2 r> 

Jn,k(f) = —Tf / f( u iT---> u k){li'n{du 1 )- n(du 1 ))...(/j, n (du k )- fi(du k )), 

where the prime in f' means that the diagonals Uj = ui, 1 < j < I < k, 
are omitted from the domain of integration. (1-1) 

Let us consider constants as functions of zero variable, and define J n ,o(c) = c for a 
constant c. Given a function f(xi, . . . , Xk) of k variables we want to give a good estimate 
on the probability P(\J n ,k(f)\ > %) for all x > 0. Our main result is the following 
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Theorem 1. Let f = f(x±, . . . , Xk) be a measurable function on the space (X k , X k (i k ) 
with some k > 1 such that 



SUp |/(xi,...,Xfc)| < 1, 

ij6X, l<J<fc 



and 



= £/ 2 (£ l5 . . . , &) = J f( Xl ,..., x k )n{ d Xl ) ...fji( dx k ) < a 2 (1.2) 

iwzt/i some constant a > 0. Let afeo assume that the measure \i on (X, X) is non- 
atomic. Then there exist some constants C = C k > and a = > 0, such that the 
random integral J n ,k{f) defined by formula (1.1) satisfies the inequality 

P(\Jn,k(f)\ >*)< Cmax (exp j-a (^) 2/ "} , exp {-^(na: 2 ) 1 /^ 1 )}^) (1.3) 

/or all x > 0. These constants C = C k > and a = a k > depend only on the 
parameter k. 

It may be useful to reformulate this result in the following equivalent form: 
Theorem 1'. Under the conditions of Theorem 1 

P(\Jn,k(f)\ >*)< Cexp | -a (^) 2A } for x < n k ' 2 a k+1 

with the number a appearing in (1.2) and some universal constants C = C k > 0, 
a = a k > 0, depending only on the multiplicity k of the integral J n ,k(f)- 

Theorem 1 clearly implies Theorem 1', since in the case x < n k / 2 a k+1 the first term 
is larger than the second one in the maximum at the right-hand side of formula (1.3). 
On the other hand Theorem V implies Theorem 1 also if x > n k / 2 a k+1 : since in this 

case Theorem V can be applied with a = (xn~ k / 2 ) 1 ^ h+1 ^ 1 > a. This yields that 
P(\J n ,k(f)\ > x ) ^C'expj-a (|) 2A } =Cexp{-a(?ix 2 ) 1 /(fc+i)} if x > n k / 2 a k+1 . 

As it will be seen later, the expression ||/|| 2 considered in formula (1.2) has the 
same order as the variance of the random integral J n ,k(f)- Beside this, if rj is a random 

variable with standard normal distribution, then P(\ar] k \ > x) = P I \rj\ > ( — ) < 

r /x\ 2 / k \ 

const, exp < — y— j >, and this inequality is essentially sharp. Thus the results of 

Theorem 1 or Theorem 1' state that the tail probability P(\J n ,k(f)\ > x ) °f the /c-fold 
random integral J n ,k(f) behaves similarly to P( \ const. arj k \ > x), where the random 
variable rj has standard normal distribution and a is the variance of J n ,k{f)i provided 
that the level x we consider is less than n k / 2 a k+1 . It can be shown that such a condition 
on the level x is really needed. 



Actually, I am interested in the following more general problem: Let a nice class 
of functions / G T be given on the space (X k ,X k ), and let us give a good bound on 

the probability P sup | J n k(f)\ > x ) . In a subsequent paper [7] I shall show that for 

nice classes T of functions, for instance if JF is a so-called Vapnik-Cervonenkis class of 
functions bounded by 1, a bound similar to that in Theorem 1 can be given for this 
maximum. But to show this first Theorem 1 of this paper has to be proved. 

I met such a problem when I tried to apply the method of proof for the existence 
of a Gaussian limit of the maximum likelihood estimates to certain non-parametric 
problems. (See [5] and [6].) The proof in the parametric case contains a simple but 
important linearization step. In this step the function appearing in the maximum 
likelihood equation is replaced by its Taylor expansion around the (unknown) parameter 
up to the first term, and it is shown that such a linearization causes only a negligibly 
small error. In my attempts to adapt this argument to the non-parametric case the 
result of Theorem 1 had to be applied. Actually this estimate was needed only in the 
case k = 2. With its help a bound can be obtained which corresponds to the estimate 
of the second coefficient in the Taylor expansion of the classical maximum likelihood 
equation. Also the problem discussed in paper [7] appears in certain non-parametric 
estimation problems. For instance in the estimation of a distribution function with the 
help of some observations such a problem appears in a natural way. In this case we 
have to bound the difference between the distribution function and its estimate for all 
numbers x, and this requires a different linearization for all numbers x. In this problem 
we want to estimate the supremum of the error in all points x, and this requires a bound 
on the supremum of a class of random integrals J n ,2(f)- 

Earlier I could only prove a weaker version of Theorem 1 in paper [4] and applied 
this result. In that paper I could not prove a better estimate for random integrals 
Jn,k(f) with a small variance. I do not know of other papers where the distribution 
of the random integral J n ,k(f) was investigated directly. On the other hand, some 
deep and interesting results were proved about the tail-behaviour of [/-statistics, more 
precisely about the tail-behaviour of so-called degenerated [/"-statistics with canonical 
kernel functions (see [1], [2]), and they are very close to our results. It may be worth- 
while to discuss the relation between them in more detail. To do this first I recall some 
notions about [/-statistics. 

Let us consider a function / = f(xi, . . . , Xk) defined on the k-th power (X h , X k ) 
of a space (X, X) together with a sequence of independent and identically distributed 
random variables £1,62, ••• which take their values on this space (X,X), and let \i 
denote their distribution. We define with their help the [/"-statistic I n ,k(f) 

In,k(f)=^ E (1-4) 

l<j' s <n, s=l,...,k 
js^j s ' if s^s' 

(The function / in this formula will also be called a kernel function in the sequel.) 



3 



A real valued function / = f(xi, . . . ,Xk) on the /c-th power (X k , X k ) of a space 
(X, X) is called a canonical kernel function (with respect to a probability measure on 
the space (X, X)) if 



/ 



f(xi, . . . , Xj-i, u, Xj+i, . . . , Xk)^( du) = for all 1 < j < k and x s &X,s^j. 



In an equivalent form this means that if £j, 1 < j < k, are independent random variables 
with distribution fj,, then Effa, . . . = xi, . • - = Xj-i, = x^+i, . . = 

Xk) = for all 1 < j < k and i s G I, s ^ j. 

The above definition slightly differs from that given in the literature, where gen- 
erally it is assumed that the kernel function in the [/-statistic is symmetric. But this 
difference is not essential. It is not difficult to see that I n ,k(f) = ^n,fc(Sym/), and our 
definition is equivalent to the usual one in the case of symmetric kernel functions. 

The definition of the integral J n ,k(f) contains a certain kind of regularization, since 
integration is taken with respect to the k-th power of the signed measure (i n — \i in it. 
On the other hand, the definition of [/-statistics contains no such regularization. This 
has the consequence that only [/-statistics with a canonical kernel function satisfy an 
estimate similar to Theorem 1. Theorem 1 has the following consequence. 

Theorem 2. If f = f(x\, . . . , Xk) is a canonical function on the space (X k , X k , n ), 

oo = sup \f(x u ... 7 x k )\ < 1 

XjEX, l<j<k 



and 



i = ^/ 2 (6, • • • , 6) = J fi^i-,----, Xk)fJ>( dxt) ...fj,( dx k ) < a 1 



then there exist some constants C = Ct > and a = au > such that the U -statistic 
In,k(f) defined in formula (1-4) satisfies the inequality 

P (n- k / 2 \I n)k (f)\ > x) < Cmax (^exp j-a (^^j ,exp ^-a{nx 2 ) l ^ k+1) }^ (1.5) 

for all x > 0, where the constants C = C k > and a = a k > depend only on the 
parameter k. 

I shall show that Theorem 2 is a simple consequence of Theorem 1. It is also 
possible to prove Theorem 2 directly and to deduce Theorem 1 from it. But such 
a deduction demands more work. A basic result of the theory of [/"-statistics, the 
Hoeffding decomposition may help, since it enables one to represent general [/-statistics 
(the random integral J n ,k(f) a ^ so can be considered as an [/-statistic) as a sum of U- 
statistics of different orders with canonical kernel functions. But to deduce Theorem 1 
from Theorem 2 some additional work is still needed. Here I do not deal with this 
problem, but the end of Section 2 in [7] indicates what kind of calculations are needed 
to work out the details. 
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Part c) of Theorem 2 in [1] gives an estimate equivalent to our Theorem 2. It states 
(with the notations of the present paper) that under the conditions of Theorem 2 

P > ,) < „ exp {- — - ^ 1/2fik+1) } (1-6) 

with some universal constants c\ and C2 depending only on the parameter k. In [1] 
this estimate is called a new Bernstein-type inequality. It is formally different from our 
Theorem 2, but actually they are equivalent if we do not specify the universal constants 
in these estimates. It is simple to see this if the two cases when the first and the second 
term is dominating in the denominator at the right-hand side of the expression in (1.6) 
are considered separately. 



2. The idea of the proof 

We shall prove Theorem 1 by first giving a good estimate on certain even moments of the 
random variable J n ,k(f) and then by applying the Markov inequality P(\J n ,k(f)\ > x) < 
EJ ( f) 2M 

w ' 2M . In the proof we need a good bound on the moments EJ n ^{f) 2M - We do 

not have to estimate all even moments, but a good estimate is needed for a sufficiently 
large class of moments. It is enough to bound the moments of the form M = 2 m , 
m = 0,1,2,..., and this is done in Proposition 1. The estimate of Proposition 1 is 
contained in formula (3.1). This formula presents an upper bound for EJ ni k(f) 2M in 
the form of the maximum of two terms, where the first term has the same order as the 
2M-th moment of a random variable const. arj k with a random variable rj with standard 
normal distribution, and the second correction term is needed to make relation (3.1) 
valid also for small values of a. 

The main problem of this paper is how to prove Proposition 1. To overcome 
this difficulty let us consider an analogous problem when the moments of a multiple 
stochastic integral with respect to a Gaussian process (with independent increments) 
are estimated and try to adapt the method applied there. An important result about 
stochastic integrals, the diagram formula (see e.g. [3]), enables us to prove a good 
estimate for the moments of multiple stochastic integrals with respect to a Gaussian 
process. It expresses the product of two multiple stochastic integrals as the sum of 
certain multiple stochastic integrals with different multiplicities. Here also a constant 
term may appear which can be considered as a stochastic integral of zero order. The 
name diagram formula refers to the fact that the kernel functions of the stochastic 
integrals appearing in this identity are defined with the help of certain diagrams. The 
diagram formula enables us to express the 2M-th power of a multiple Gaussian stochastic 
integral as the sum of certain stochastic integrals and to bound its expected value. I 
want to show that an appropriate adaptation of this method can yield the proof of 
Proposition 1. 

To carry out such a program we need a result analogous to the diagram formula 
where the product of two multiple stochastic integrals J n ^ x (/) and J n ,k 2 (<?) with re- 
spect to a normalized empirical measure is expressed as the sum of multiple stochastic 
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integrals. Such a result is proved in Lemma 2, and this also will be called the diagram 
formula. The main difference between Lemma 2 and the corresponding result in the 
Gaussian case is that now some new terms appear. The kernel functions of the multiple 
stochastic integrals in this new version of the diagram formula will be defined with the 
help of an object I shall call coloured diagram. 

This diagram formula enables us to bound the 4M = 2 m+2 -th moment of multiple 
stochastic integrals if we have a good bound on their 2M = 2 m+1 -th moment. This 
suggests to try to give a bound on the moments EJ n ^(f) 2 by means of an inductive 
procedure with respect the parameter m. To do this first we have to estimate the first 
and second moment of a multiple stochastic integral J n ,k(f)- Such an estimate is given 
in Lemmas 1 and 3. They are slightly different from the formulas that appear in the 
study of Gaussian multiple stochastic integrals. The expected value of the stochastic 
integral J n ,k(f) ma Y be non-zero, and instead of an exact identity I can only give an 
upper bound for its second moment. The cause of this difference is that, unlike in the 
Gaussian case, the (normalized) empirical measures of disjoint sets are not independent. 
But the dependence between them is very weak, therefore the expected value EJ n ^(f) 
is so small that its non-zero value causes no problem for us. The bound we give for this 
expected value does not depend on the sample size n, at least if n > |. On the other 
hand, the estimation we gave in Lemma 1 depends on the multiplicity of the integral 
Jn,k(f)- We need such a bound, since we estimate the moments of the random integrals 
Jn,k(f) simultaneously for all k = 1,2, 

The diagram formula reduces the estimate of the 4M = 2 m+2 -th moment of a 
multiple random integral to the estimate of the 2M = 2 m+1 -th moment of a sum of 
multiple random integrals. If we do not try to get the best possible constant a in 
Theorem 1 then it is enough to give a sufficiently good estimate on the 2M-th moment 
of each term in this sum. To get such an estimate the following analytical problem has 
to be solved: Given two functions f(x\, . . . , Xfc-J and g(x\, . . . ,Xk 2 ) let us express the 
product of the random integrals J n ,k 1 (f)Jn,k 2 (9) by means of the diagram formula and 
give a good estimate on the L 2 -norm of the kernel functions appearing at the right-hand 
side of this expression. The solution of this problem is a most essential part of our proof. 

Such an estimate is given in formulas (3.14) and (3.15) which can be proved in a 
natural way by means of the Schwarz inequality. Formula (3.14) yields an estimate for 
the L2 norm of general kernel functions appearing in the diagram formula, and formula 
(3.15) provides a better bound in the case of those kernel functions which also appear 
in the Gaussian case. But these estimates alone are not sufficient for our purposes. 
It turned out that in certain cases the Li norm of the kernel functions defined in the 
diagram formula can be better bounded. Moreover, to get a sufficiently good estimate 
for the moments bounded in Proposition 1 we have to exploit this improvement. 

This lead us to the introduction of the notion of (r, a 2 ) dominated functions and 
the formulation of Lemma 4 which contains the most important properties of functions 
with such a property. After Lemma 4 I also formulated Proposition 2. This is a 
sharpened version of Proposition 1 which yields a better estimate for the expected 
value EJ nj k(f) 2M if the variance a 2 of J n ,k(f) is small and / is an (r, a 2 ) dominated 
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function with some large r. Proposition 2 was introduced not because the estimate 
of Proposition 1 was not good enough for us, but because the inductive procedure we 
apply in the proof works only for Proposition 2. To carry out this procedure we must 
have a sufficiently good estimate for each term appearing in the intermediate steps, and 
only Proposition 2 provides a good estimate for them. 

To prove Proposition 2 still we need a technical result formulated in Lemma 5. 
By working out the arguments sketched above the expected value EJ Ujk (f) 2M can be 
estimated by a sum with terms possessing a good upper bound. We have to show that 
these bounds also yield a sufficiently good estimate for the sum itself. This is not a 
trivial problem, and Lemma 5 helps us to overcome this difficulty. 

This paper consists of five sections. In Sections 1 and 2 the main results of the 
paper were presented and the method of the proof was explained. The lemmas and 
propositions needed in the proof are formulated Section 3, and also Theorems 1 and 2 
are proved there. Lemmas 1 — 4 are proved in Section 4. They describe some important 
properties of the random integrals J n ,k(f) defined in formula (1.1). I also put Lemma 
4 to Section 4, although it contains a result about a deterministic transform of some 
functions, so formally this result is not connected to the properties of the random 
integrals J n ,k- Nevertheless, it can be put among the important results about such 
random integrals, because it plays an essential role in the estimation of the high moments 
of the terms appearing in the diagram formula. Finally, Section 5 contains the proof of 
Proposition 2 together with the rather technical Lemma 5 needed in its proof. Theorem 1 
was proved as a consequence of this Proposition. 

3. The proof of Theorems 1 and 2 with the help of some lemmas 

Theorem 1 will be proved with the help of the following result. 

Proposition 1. If f = f(xi, . . . , x k ) is a function of k variables, 

ll/lloo = sup |/(xi,...,x fc )| < 1, 

Xj€X, l<j<k 

and 

ll/lll = J f 2 (%i, ■ ■ ■ ,x k )n(dxi) . . . fi(dxk) < cr 2 , 

the measure [i is non-atomic, then there exist some constants Ck such that for all num- 
bers of the form M = 2 m , where m is a non-negative integer, and n = 1, 2, . . . 

EJ n , k (ff M < {Ck* 2 M k ) M ■ max ^1, (JL^j ^ . ( 3 .1) 

The constant Ck depends only on the number of variables k in the function f . 

Proposition 1 states that typically the 2M-th moment of the random variable 
Jn,k(f) behaves like the 2M-th moment of const. arj k , where r\ is a random variable 
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with standard normal distribution. But such an estimate holds only if no 2 is not too 
small. Formula 3.1 also contains a constant Ck not given explicitly. The parameter a 
in the exponent of the estimate in Theorem 1 depends on its value. 

To prove Proposition 1 we give an upper bound for the first and second moment of 
a random integral J n ,k(f) if / is a bounded function with k variables and prove some 
kind of diagram formula which expresses the random integral J n ,k(f) 2 or more generally 
the product of random integrals J n ,k x (f)Jn,k 2 (9) with bounded functions / and g of k± 
and k 2 arguments as the sum of appropriate random integrals J n of some functions 
which have < k < k± + k 2 arguments. 

To express a product J n ,fci (f)Jn,k 2 (fiO °f random integrals with functions / and g 
of k\ and k 2 arguments respectively in the desired form we introduce some diagrams 
and coloured diagrams and define some functions with their help. 

Given two positive integers k\ > and k 2 > let us consider a set 

Af={(j 1 ,f 1 ),...,(ji,fi)} 

of pairs of integers such that 1 < j s < ki, k± + 1 < j' s < k\ + k 2 , 1 < s < I, and 
3s 7^ Js', j' s ^ j' s ' if s ^ s'. Let us assume that 1 < jt < j 2 < • • • < ji < h to 
introduce an explicit ordering among the elements of the set N '. We define the diagram 
B{JV) = B{N, k\,k2) in the following way: The diagram B{J\f) has two rows, the first 
row consists of the vertices 1, . . . , k\ and the second row of the vertices fei + 1, . . . , ki+k 2 . 
The diagram B(M) contains the edges connecting the vertex j s from the first row with 
the vertex j' s in the second row, (j s ,j' s ) E J\f, 1 < s < I. Given a diagram B{J\f) and 
a subset Mi C JV we define the coloured diagram B(J\f,J\fi) = B(N,N\, fei, k 2 ) as the 
diagram B(N) whose edges are coloured in the following way: An edge (j u ,j' u ) has 
colour 1 if (j u ,j' u ) G M, and colour -1 if (j u ,j' u ) G jV\M- 

Let us also define the following operators on the space of functions. Given an inte- 
grable function h(x±, . . . , x s ) on the product space (X s , X s ', fx s ) with some s = 1, 2, . . . , 
define the operators Pj = P JjS , 1 < j < s, as 

Pjh(x u . . . ,Xj- U x j+u . . . ,x s ) = J h(x!, . . . ,x j - 1 ,u,x j+1 , . . . ,x s )/j,(du). (3.2) 

Let us fix a pair of integers (j,j'), 1 < j < k±, k\ + 1 < f < k\ + k 2 and a function 
h( x s(i)> ■ ■ ■ i x s(r)) ° n an appropriate product of the space (X, X, whose variables are 
indexed by some numbers 1 < s(l) < s(2) < ••• < s(r) < k± + k 2 , and {j, j'} C 
{s(l), . . . , s(r)}. In this case we define the function Rjj'h as 

s(l) ? • • • i X s{p — l)i %s(p)i %s(p+l)i • • • ? X s{q — 1)? X s{q)i %s(q + l)i • • • ? %s(r)) 
= /i(x s (i), . . . , X s (p_x), X s ( p y X s (p_|_i), . . . , X s (g_x), X s (p), X s (g_|_i), . . . , iC s (r)) (3-3) 

if s(p) =j, and s(q) = j' 

In words: The application of the operator Rjj' means that the argument xy is replaced 
by the argument Xj. 
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Given two functions f(xi, . . . , x kl ) and g(xi, . . . ,x k2 ) on the spaces (X fcl , X kl ) and 
[X k2 ,X k2 ) respectively define the function 

f og( Xl ,...,x kl+k2 ) = f(x 1 , . . . ,x kl )g(x kl+1 , . . . ,x kl+k2 ) (3.4) 

and given a diagram B(N) = B(Af, fei, k 2 ) with edges A/" = • • • , (jz, j'/)} put 

z 

/ '° 9b(AT)( x s(1), ■ ■ ■ ,X a (fc 1+ fc 2 _i)) = JJ J R itJ //o 5 r(xi,...,X fcl+ fc 2 ) (3.5) 

t=l 

with s(-u) = m if 1 < u < ki and to define s(u) in the case k\ + 1 < u < ki + k 2 — I 
let us list the set {k x + 1, . . . , h + k 2 } \ .. ., j[] = {v(l), . . . , v(k 2 - /)}, h + 1 < 
v (1) < • • ■ < f (/«2 — < ki + A;2 in increasing order and put s(u) = v(u — k\). In words, 
the following construction was made. Take the product of the functions / and g, and 
replace the argument xy a by Xj a for all edges (j s ,j s ') of the diagram B(J\f), and then 
the indices of the variables are not changed. The numbers s(u) were introduced to list 
the indices of the variables in an increasing order. 

Given a coloured diagram B{N,N\) = B(Af,Afi, k\, k 2 ) with 

M= and M = {(ju(i),j' u (i)),---,(ju(p),j'u(p))}, 

where {^(1), . . . , u(p)} C {1, . . . , /} define 

p i 

f ° 9B(ATMi)( x <i)>- ■ ■ i x s(k 1+ k 2 -i- P )) = Yl p ju (v) Yl R jt,fJ °9(xu...,x kl +k 2 ) (3.6) 

v=l t=l 

with the help of the operators Pj and Rjj> defined in (3.2) and (3.3), where the operators 
at the right-hand side are applied from right to left order and then the indices u(-) at 
the left-hand side are defined in the following way: Let us list the sets {w(l), . . . , w(k\ — 
P)} = {1, - - - , fei} \ {ju(i),---,ju( P )}, 1 < w(l) < ■■■ < wih-p), and {v(l), . . . , v(k 2 - 
I)} = {k 1 + l,...,k 1 + k 2 }\ O'i, • • • , fi), h + l< v(l) <■■■ < v(k 2 -l)<h + fc 2 , in 
increasing order. Then we define s(u) = w(u) if 1 < u < k\ —p, and s(u) = v(u — k\ +p) 
if ki — p + 1 < u < ki + k 2 — I — p. In words, take the function / o g B ^ and apply the 
projection operator P u for those indices u which are end-points of edges with colour 1 
in the first row of the coloured diagram B(Af,J\fi) i.e. which are end-points of edges in 
Afi in the first row. After this procedure these variables x u disappear. The remaining 
variables are preserved together with their indices. The numbers s{u) were introduced 
again to list these variables in increasing order. 

To formulate the diagram formula we define the function f ° gB(M,Ni) w ith the help 
of the function / o gB{M,Mi) * n f° uowm g wa Y- 

/ ° 9B(Af,Ari)( Xl > • • • ' x ki+k 2 -l-p) = f ° fi , B(AT,M)( :r s(l)' • • • ) x s(k!+k 2 -l-p))- (3-7) 

Here we reindexed the variables of / o g and listed them in an increasing order. 



9 



Now I formulate some lemmas needed in the proof of Proposition 1. 

Lemma 1. Let f(xi, . . . , x k ) be an integrable function on a space (X k , X k , /i k ) where 
(i k is the k-th power of the probability measure \i appearing in formula (3.1). Let the 
measure fx be non-atomic. Then 

EJ n}k (f) = B n)k J f(ui,...,Uk)/Ji(dui)...iJL(duk) (3.8) 



with 

k 



.* = E(-!) fc - s C) * ! E ^ - 1) • • • (*■» - ■ ■ ■ > 

5=1 (ri,-,r s ) 

Vj is integer rj>l, l<i<s, 
SiH \-r s =k 

(3.9) 

where B(r±, . . . ,r s ) equals the number of partitions of the set {1, . . . , k} to disjoint sets 
with cardinalities r±, . . . ,r s . The above constants -B n ,fc satisfy the estimate 

\ B nA < ^ (3-10) 
with some universal constant C > for all n > | . 

Lemma 2. (Diagram formula) Let f = f(x\, . . . ,x kl ) and g(x\, . . . , X}~ 2 ) be two 
square integrable functions on the product spaces (X kl , X kl , fi kl ) and (X k2 , X k2 , (i k2 ) 
respectively, and assume that the measure [i is non-atomic. Then the identity 

min(fei,fc 2 ) I _|_ fc 2 _ / 

Jn,kAf) J n,kM = Yl Yl ( fcl _ ^)1( fc2 ^ /)!(/ _ |?)!|? , n " (Z " P)/2j n,fc 1 +fc 2 -;- P (/0( 7 ; )P ). 

(3.11) 

/io/ds wift £/ie functions f o gi p = f o gi^^M of k\ + ki — I — p variables, < p < I < 
min(/ci, k 2 ), defined by the formula 

f f , (h - l)l(k 2 - 1)1(1 - p)!p! 

1- 2 ' (3.12) 

2 , f °gB{M,N 1 ){ x li---i x k 1 +k 2 -l-p) 

B{NMi)£B(l,p) 

where the functions f ° gB{N,Ni) are defined by formulas (3.2), (3.3), (3.6) and (3.7), 
and the set 13(1, p) = B(l,p,k\,k 2 ) consists of those coloured diagrams B(M,Mi) = 
B(H,Hi, ki, k 2 ) with k\ vertices in the first and k 2 vertices in the second row for which 
\M\ = I, and = p. (We apply here and in the sequel the convention that \ A\ denotes 
the cardinality of a set A.) 

10 



_ l)Uk 2 - 1)1(1 - p)lpl 

Let us remark that the norming constant r~rr~t m (3-12) is the 

fcl'fo! 

reciproque of the cardinality of the set 13(1, p, k\, fo). 

Lemma 3. Let f(x\, . . . , x k ) be a square integrable function on the space (X k ,X k ), 
and let fx be a non-atomic measure on (X, X) . Then for all n > k 

EJ n , k (f) 2 <^\\f\\i (3-13) 
with some universal constant C > 0. 

The diagram formula formulated in Lemma 2 enables us to estimate EJ njk (f) 2 + 
if we have an estimate for the expectation EJ nk -(g) 2 for all bounded functions g (where 
also the number of arguments in the function g may be arbitrary). This may enable 
us to prove Proposition 1 by induction. But to prove a sufficiently good estimate, 
first we have to prove a good bound on the L2 norm of functions of the form / o 
fB(N,Ni)( Xl -> • • • ' X 2k- P ) with some coloured diagram B(H,Hi). We will show that 

\\f 9b{MMi)\\1 ^ II/II2IMI2 for any diagram B(M,Ni) if ||/||oo < 1 and H^H^ < 1. 

(3.14) 

Moreover, in the special case when \M\ — |A/*i| this estimate can be improved. Namely, 
we shall show that 

H/o^(^)lli<ll/ll2lMl2 if|^| = |M|- (3.15) 

Let us call a diagram B(M,N\) Gaussian if M = M\, and non-Gaussian in the other 
case. The introduction of such a terminology is natural, since in the diagram formula for 
multiple stochastic integrals with respect to Gaussian processes those diagrams appear 
(with the same integrals attached to them) which we called Gaussian. The most difficult 
part of our investigation is to check the contribution of the integrals corresponding to 
non-Gaussian diagrams. 

To prove formula (3.14) let us take two functions f(x\, . . . , x^) and g(x\, . . . , Xk 2 ) 
whose supremum norm is bounded by 1 and consider the function / o g B ^^ with some 
diagram Af defined by formulas (3.3), (3.4) and (3.5). The Schwarz inequality yields 
that 



f °9 B {M) 2 < (y / 4 (^i,---,^fc)^(^i)---^(^ fcl )^ 

j g 4 (x x , . . . ,x k2 )n(dxi) . . . n(dx k2 ) 
j f 2 (x 1 ,...,x k )/j(dx 1 )...ij(dx kl ) 
{^j g 2 (xi, ■ ■ ■ ,x k2 )ix(dxi) . . . fi(dx k2 )^ = II/II2IMI2 



1/2 

1/2 

1/2 

1/2 
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if H/lloo < 1 and U^H^ < 1. 

The operators Pj are contractions in L 2 norm. Hence the above estimate together 
with formulas (3.5), (3.6) and (3.7) imply that 



\\f 9B(^Mi)\\ 2 = / #s(Af,AM - f°9B{X) 



< 



2\\9\\2, 



and this is relation (3.14) 

2 

To prove formula (3.15) let us first give a pointwise estimate on /o 9b(N M) with 
the help of the Schwarz inequality. We get by using formulas (3.2), (3.3), (3.4) and (3.6) 
that 

, 1 

/°#B(Af,Ao( x s(i)'---' x s(fci+fc2-*-p)) < / f 2 (xi,...,x kl ) Y[^(dx js ) 

J s=l 

/ f 2 (^fci+i,---,^i+fc 2 ) n^(^)' 

J s=l 



(3.16) 



where A/" = {(ji, • • • , (ji, Beside this, the arguments of the function at the 

left-hand side of (3.16) are the elements of the set {s(l), • • • , s(k± + k 2 — I — p)} = 
{1, . . . , ki + k 2 } \ {ji, . • • , ji, j[, ■ ■ ■ , ][}■ The important point is that the estimate at the 
right-hand side of (3.16) is the product of two functions with different arguments. We 
get by integrating both sides of inequality (3.16) that 



/ 9b{N,N) 



< 



2 \\n\\ 2 
2115112- 



This inequality together with the identity ||/ o g B ^ iM) ||| = 11/ 9b{M,M) II 2 imply for- 
mula (3.15). 

Lemma 3 gives a bound on EJ nj k(f) 2 for a function / with the help of ||/|||) and 
the application of Lemma 2 with the choice / = g enables us to bound EJ Uj k(f) 4M if we 
have a bound for the expressions of the form EJ n ^(g) 2M . In such a way we can prove 
good bounds on the moments EJ nj k(f) 2 by induction. To carry out such a program 
we need a good bound on the L 2 norm of the terms at the right-hand side (3.12) in the 
special case / = g. Formulas (3.14) and (3.15) supply such estimates, but they are not 
always good enough for our purposes. 

These estimates are sufficient to bound EJ nt k(f) 2M for M = 1, 2, but do not suffice 
already in the case M = 4. Indeed, to give a good bound on EJ nj k(f) 8 with the help 
of Lemma 2 we also have to give a good bound on -E , J n ,2fc(fi') 4 with g(x\, . . . , x 2 k) = 
f(xt, . . . ,Xk)f(xk+i, ■ ■ ■ ,X2k)- (This term is / o f B (N,Ni) with A/" = M = 0-) To 
estimate this expression (again with the help of Lemma 2) we also have to bound 
^^n,4fc-i(fi'i) 2 with the function gi{x u . . . , x 4fc _i) = g(x 1 , . . . , x 2k )g{x u x 2k+2 , . . . , x 4k ) 
(here we consider the diagram B{N,Ni) with M = {(1, 2k + 1)} and Mi = 0), and to 
estimate -E , J n ,4fc-i(fi , i) 2 we have to bound L 2 norm of the function gi. 
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Formulas (3.14) and (3.15) imply that \\g\\ 2 2 < a 4 and < a 4 if ||/||| < a 2 . But 

a more careful analysis shows that the function gi also satisfies the sharper estimate 
||<7i 111 — °" 6 - Indeed, the function g is the product of two functions with different 
arguments, and the L2 norm of both terms is bounded by a. Further considerations 
show that the function gi can be written as the product of three terms whose L2 norm 
are bounded by a, and this implies the above statement. 

If || /|| I — 0-2 with a small number a, then to get a sufficiently good estimate for 
EJ n ,k(f) 2M for M > 4 with the help of Lemma 2 a sufficiently good bound has to be 
proved about L2 norm of certain functions. The estimates (3.14) and (3.15) are not 
always sufficient for this goal. In the previous example we could get some improvement 
by exploiting the special structure of certain functions. It worked, because the function 
whose L 2 norm we had to bound could be represented as the product of functions with 
different arguments and small L2 norms. An appropriate refinement of this argument 
would also supply a useful estimate for the L2 norm of such functions which can be 
bounded by the product of functions with different arguments and small L 2 norm. To get 
sufficiently good estimates which lead to the proof of Propitiation 1 we have to exploit 
the improvements following from the above sketched observations. Such considerations 
lead to the introduction of the following definition. 

The definition of (r, a 2 ) dominated functions. Let us consider a function f = 
f(x Ul , . . . , x Uk ), x Uj e X, 1 < j < k, of k variables. We say that this function is (r, a 2 ) 
dominated with 1 < r < 00, < a < 1 if the set {ui, . . . , ui-} can be partitioned to 

r 

r disjoint sets B\ = . . . , j Sl ^}, 1 < I < r, \J Bi = {ui, . . . , u^}, and for all 

1=1 

1 < I < r such functions hi = hi(x .(0 , . . . , x .(0) can be defined whose arguments are 

3\ 3si 

indexed by the elements of the set Bi, and which satisfy the inequalities 

r 

\f(x Ul ,...,x Uk )\ < Y[hi(x.(i),...,x.(i)) for all (x Ul , . . . , x u J 
1=1 1 s ' 

and \\hi\loo < 1, \\hiW 2 , < o 2 , 1 < I < r. 

In particular, we allow that some of the sets Bi be empty in the above partition. If 
the set Bi is empty, then the function hi has to be constant, and < hi < a. 

Let us remark that a function / is (1, a 2 ) dominated if and only if ||/||oo < 1, and 
ll/l| 2 <a. 

The following Lemma 4 which uses the above definition plays a most important 
role in the proof of Proposition 1. We formulate it in a form more general than we need. 
We shall consider two functions / and g, although we shall apply this result only in the 
special case when / = g. 

Lemma 4. Let f be a function on the product space (X kl , X kl , /i kl ) and g a function 
on the product space (X k2 , X k2 , fi k2 ) with some integers k\ >\,ki> 1. Let us take a 
coloured diagram B(J\f,j\f\) = B(j\f,j\fi, k±, k^) with \M\ = I and \j\fi\ = p and consider 
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the function f ° gB(N,Ni) defined by formulas (3.2), (3.3), (3.6) and (3.7). Let f be 
an (ri, a 2 ) and g an (r 2 , a 2 ) dominated function. If the relation r\ + r 2 > I — p + 1 
holds, then the function f o gB{M,Mi_) ^ s ( r i + r 2 — (I — p), o" 2 ) dominated. Beside this, 
the function f o gB{M,Mi) ^ s (1> a^ Vl+r2 '' 2 ) dominated for all < p < I < min(/ci, fc 2 ). 

It is clear that if / is an (ri, a) and g is an r 2 , cr) dominated function, then fog 
is an ri + 7*2,0" dominated function. Lemma 4 essentially states that if we calculate 
the function / o gB(M,Ni) with the help of the function fog, then the edges of M\ 
(with colour 1) do not diminish and the edges in M\ Afi diminish only with 1 the first 
coordinate in the (ri + r 2 , cr 2 ) dominance property. 

With the help of Lemmas 1 — 4 the following Proposition 2 can be proved which 
yields a more detailed description on the moments of the random integrals J n ,k(f) than 
Proposition 1. 

Proposition 2. Let f = f(x±, . . . , Xk) be a measurable (r, cr 2 / r ) dominated function of 
k variables, k > 1, with some r > 1. Put M = 2 m . Let us assume that the measure fx 
appearing in formula (3.1) is non-atomic. Then for all m = 0, 1, . . . such that kM < n 

with some constants C(k,m) > 1 which do not depend on the parameters n and a, and 
they satisfy the inequality sup C(/c, m) < Ck < oo for all k = 1,2, ... . 

0<m<oo 

Remark: The condition kM < n in Proposition 2 is of technical character which prob- 
ably can be omitted. We imposed this condition to avoid the investigation of such 
degenerate random integrals J n ,k( f) where the number of variables of the function / is 
larger than the sample size n. To avoid such a situation during the inductive procedure 
leading to the proof of Proposition 2 we had to impose the condition kM < n. This 
restriction does not cause a serious problem in the proof of Theorem 1. 

Proposition 1 follows from Proposition 2 if we apply it for (r, cr 2 / r ) dominated func- 
tions / = f(xi, . . . , Xk) with r = 1 and choose C = Ck = fc _ 1 sup C(/c, m). Actu- 

0<m<oo 

ally Proposition 2 implies Proposition 1 only under the additional condition kM < n. 
But Proposition 1 can be proved in the case n < kM with the help of the following very 
rude estimate: Since the measure /U n — fi is the difference of two probability measures and 
H/lloo < 1, hence ||J n ,fc/||oo < 2 k n k / 2 and EJ n , k {f) 2M < 2 2kM n kM < 2 2kM (kM) kM < 

n -M 2 2kM\ kM ^{k+l)M = C M n -M M (k+l)M ^ Cfc = 4 fc fc fc+l if n < fcM _ Thig ^ 

mate means that the estimate (3.1) remains valid with a possibly new constant Ck if 
we also allow the case n < kM. (Here we use the second term in the maximum at the 
right-hand side of (3.1).) Hence it is enough to prove Proposition 2. 

Proposition 2 will be proved by induction with respect to m. For m = Lemma 3 
implies the result we need. If we know the bound (3.17) for some m with a constant 
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C(k, to), then this bound together with Lemma 4 enable us to give a good bound on the 
2 m -th moment of the expressions J n ,2k-(l-p) (f°fl,p) with the functions fofi tP appearing 
in formula (3.12). Then we can prove formula (3.17) for m + 1 with an appropriate 
constant C(k,m + 1) by means of the diagram formula presented in Lemma 2. The 
calculations made during this proof show that constant C(/c, m + 1) appearing in this 
estimate depends only on the parameters k and m, i.e. it does not depend on the 
sample size n or the variance a 2 . Moreover, an important but rather technical result 
formulated below in Lemma 5 will enable us to show that the norming constants C(k, to) 
can be chosen relatively small. By exploiting the existence of some constants C(k, m) 
with the properties formulated in Lemma 5 we can show that in Proposition 2 the 
constants C(/c,m) can be chosen in such a way that they also satisfy the inequality 
sup C(/c, to) < oo. 

0<m<oo 

Lemma 5. There exists a set of positive real numbers C{k,m), (7(/c,0) = 1, m = 
0, 1, . . . , k = 0, 1, . . . , which satisfy the inequality 

C(k, to + If > 2-( 4 -) (2fe) 2fc -' + n2fe-j-p) 3 '-^ 2 ^ (2fc _ l _ ^ m) ^ 
for all k = 0, 1, 2, ... , to = 0, 1, 2, . . . and < p < I < k, 

and 

sup C(k,m) < D(k) < oo for all k = 0, 1, 2, . . . (3.19) 

m>0 

with some constants D(k) > 0, k = 0, 1,2, ... . The numbers C(k,m) can be given in 
the form C(k,m) = C(m) k , where C(m), m = 0,1,2,..., is a monotone increasing 
sequence such that C(0) = 1, and lim C(m) < oo. 

We finish Section 2 by proving Theorems 1 and 2 with the help of Proposition 1. 
The proof of Theorem 1. By Proposition 1 and the Markov inequality 

/M ./„,,(/) > x) < ( 9J^1\ M . max ( i. (M) M I (3.20) 



. I ' l 9 



for all positive integers M of the form M = 2 m . Let us assume for a while that 
the number x satisfies the inequalities — > 2^ k+1 ^ 2 Ct^ 2 and nx 2 > 2 k+3 Ck- Then the 



1 C k a 2 M k 1 



number M = 2 m can be chosen in such a way that the inequality ^ k+1 < ^ < ^ ' 

1 fx\ 2 / k 1 fx\ 2 / k 

i-e. ; rTTT^TTTTi I — ) < M < - — - ,, ( — ) holds. Let us choose such a number 
(2 k+1 C k ) 1 / k \aJ ~ (2C k ) 1 / k \aJ 

M if ; 7TT77I - < na 2 , i.e. x < V2C k 'n k / 2 a k+1 . In this case M < no 2 , the 
(2C k y/ k \aJ 

first term is dominating in the maximum at the right-hand side of formula (3.20), and 
P (|«^n,fc(/)| > x) < 2~ M < exp < —a y—J > with some appropriate a > 0. 
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In the case x > y/2C k ~n k / 2 a k+1 let us choose M = 2 m in such a way that .} . < 
C/ C M A:+1 1 

~ — < -. (There is a number M = 2 m satisfying this inequality, because by our 

nx £ 4 

C k M k+1 1 

temporary conditions nx 2 > 2 k+3 Ck, i.e. - — < —r— - for M = 1.) With the 

nx 2 2 k+6 

above choice of M the inequality — ~ > 2~( fc+2 )/( fc+1 ) holds, since , < 5 — < 

na z 2 fc+d nx z 

1 / M \ k+1 

— i — 2 J f° r x — v / 2C , / i; n A: / 2 a" A:+1 . Hence, by formula (3.20) and the inequality 

1 C k M k+1 1 
2( fc + 3 ) ~ nx 2 ~ 4 

P(|J„,*(/)|>x)< 



/C fc a 2 M fc \ M / 2 ( fc + 2 )/( fc +DM\ M _ /2(fc+2)/(fc+i) CfcM fc+i\ M 
\ x 2 / \ no -2 / \ nx 2 / 

< exp 



2 (fc+2)/(fc+l^ M 



< 

with some appropriate a > 0. 

The above calculations show that formula (1.3) holds if — > 2 ( - k+1 ^ 2 C^ 2 and 

nx 2 > 2 fc+3 Cfc. By choosing the constant C > sufficiently large in formula (1.3) 
we can achieve that the right-hand side of (1.3) is greater than 1 if one of the above 
inequalities does not hold. With such a choice of the constant C we get that formula 
(1.3) holds for all x > 0. 

Now we turn to the proof of Theorem 2. 

The proof of Theorem 2. Let us first consider the case when the measure \i is non- 
atomic. Let us take a canonical function f(x\, . . . ,Xk) of k variables. In this case we 
can write 



n k/2 rl 

n~ k/2 I nj k(f) = — j-j- / f(ui,...,Uk)fln(dui)...Hn(dUk) 



n k/2 



/ /(ui, . . .,itfc)(// n (dtti) - n(dui)) . . .(jj,n(duk) - v(du k )) = J n ,k(f)- 
K - J (3.21) 

(Let us remark that we have exploited at this point that the measure fx is non-atomic. 
The canonical property of the function / implies that f f(u\, . . . , dv,j) = for 
any 1 < j < k and u s £ I, j ^ s. But we need the fact that this relation remains valid 
if we omit the points u = Uj, j 7^ s, from the domain of integration.) Theorem 1 and 
formula (3.21) imply Theorem 2 in the case when fi is non-atomic. 

The general case can be reduced to the case of non-atomic measure fx with the help 
of the following construction. Given a space (X, X) with a measure fi on it together 
with a sequence of independent random variables £1, . . . , £ n an d a canonical function 
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/(xi, . . . , Xk) on the product space (X k , X k ) consider the space (Y, £>, A) where Y is the 
unit interval, B is the Borel a-algebra and A is the Lebesgue measure on it. Then define 
the product space (X, X, Jjl) as X = X x Y, X = X x A and ft = fx x X. Define the 
function / on the fc-th power of this new space (X h , X ) as f((x±, y±), . . . , (xk, yjt)) = 
f(x±, . . . ,Xk), i-e. the function / does not depend on the new coordinates. Finally, 
let us consider a sequence of independent random variables rji , . . . , r\ n with uniform 
distribution on the unit interval [0, 1] which are independent also of the random variables 
£1, . . . , £ n . Define the random variables (£1, . . . , £ n ) = ((£1, 771), . . . ,_(£ n , 7/ n )), and let /z n 
denote their empirical distribution function. Then I n ,k(f) = In,k(f), and the functions 
/ and / are simultaneously canonical. This means that the result of Theorem 2 can be 
applied for the function /, hence it also holds for the function /. 

4. On some basic properties of the random integrals J n ^ 

First we prove Lemma 1 which expresses the expected value of the random integrals 
Jn,k(f)- 

The proof of Lemma 1. Let us prove formula (3.8) first in the special case when 
f(x±, . . . , Xk) = I a x (xi) ■ ■ ■ lA k (xk), where A±, . . . , Af~ are disjoint measurable subsets of 
X and 1(A) denotes the indicator function of the set A. Let us introduce the (random) 
signed measures vi, 1 < / < n, on (X, X) 

(l-u(A) if&MeA 
u t (A) = u^u) = l l ) for A e X and 1 < / < n. 

1 n 

Then fi n — \x = — V v\ and 

n 1=1 

— k/2 / k / n \ \ 

EJ n , k {I Al {x 1 )--.I Ak {x k )) = \ r E[Y[ \Y,vi{A m ) . (4.1) 



v m=l \Z=1 



To calculate the expression in (4.1) first we fix some set of indices D = {mi, . . . , m r } C 
{1, . . . , k} of cardinality r together with a number 1 < I < n and show that 

E (xivi{A mt )\ =(r-l)(-iy- 1 f[»(A mt ). (4.2) 
\t=i / t=i 

Indeed, let us consider separately the cases when £1 e A mt for an m t G -D and the case 
when £j ^ A mt for all m t G D, and write 

P( I / J (A mt ) = l- i u(A mt )) 

= P(ui(A mt ) = 1 - /u(A m J, ^(A m J = -At(^m,) if 1 < Q < r and q ^ t) = fJ,(A mt ) 
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r 



for all 1 < t < r, andP(^(A m J = -fJi(A mt ) for all 1 < t < r)) = 1- £ M^mJ- Hence 



t=i 



r 



e n^(^) =i>(^h) n (-/*(An,))+n(-M^)) 

\t=l / i=l \l<g<r, Q^t 9=1 

+ ( 1 " $>(An t ) ) f[(-/*(An,)) = (r - lJC-ir 1 I! ^ A m t )' 
V t=l / q = l t=l 

Take a partition D\,...,D a of the set {l,...,fc} such that \D U \ = r u , 1 < tt < s, 
with some prescribed numbers r M and some integers l u , 1 < l u < n, 1 <w< s, such 
that / u 7^ l u i if w 7^ v! . The random vectors . . . , z/; u (Afc)) are independent for 

u = 1, . . . , s. This fact together with relation (4.2) imply that 

(s \ s / \ s k 

n n ^) =n £ n ^(^) =nK- i )(- i r" _i nM^) 

= (-l) fc_s JJ (r u -1) • • • , rfwi) . . . M rfwfc) 

(4.3) 

with f(u\, . . . , Wfc) = Ia 1 (ui) ■ ■ ■ lA k (uk). We get formula (3.8) together with the con- 
stant defined in (3.9) in the special case we have considered by means of relations (4.1) 
and (4.3) if we observe that by carrying out the multiplication at the right-hand side of 
(4.1) we get a sum with terms of the forms defined in (4.3). (To get a term of the form 
given in (4.3) we prescribe which measures u Uu < ui < n, 1 < I < k, we take, together 
with the partition of the set {1, . . . , k} which tells for which indices / the measure v Ul is 
chosen. We can choose (")s!S(ri, . . . , r s ) such terms, where B(r\, . . . , r s ) denotes the 
number of partitions of the set {1, . . . , k} to subsets of size ri, r 2 , ... and r s .) 

Relation (3.8) also holds for any linear combination of functions of the above 
type. Since the set of such linear combinations is dense in the L\ norm in the space 
(X k , X k , fj, k ) to prove formula (3.8) for general functions / it is enough to show that 

E\J n ,k(f)\ < C(n, k)\\f\\i for all functions f of k variables (4.4) 

with an appropriate constant C(/c,n) < oo. Let us emphasize that since in the problem 
considered here the dimension k of the function / and the size n of the sample £i, . . . , £ n 
are fixed an arbitrary large constant C(n, k) would suffice in (4.4). 

Moreover, relation (4.4) can be simplified. By carrying out the multiplications of 
the measures n n ( duj) — fi( duj) in formula (1.1) we get a sum with finitely many terms, 
and it is enough to prove the bound given in (4.4) for each of these terms. Moreover, 
by carrying out the integration with respect the deterministic measures fx(duj) we get 
that it is enough to show that 

E J |/(ui,..., u^l^idux) . . . fi n (du^) < C(n, k) \\f\\ x 

for all functions / of k < k variables. (4.5) 
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Even this relation can be reduced with the help of the triangular inequality in the Li 
norm and the fact that the linear combinations of the indicator functions of measurable 
sets are dense in the space L\(X k , X k , fi k ). It is enough to prove the following simpler 
statement: For any k < k and measurable set A £ (X h , X k ) 



'A 



^n(du k ) < C(n,k)(x k (A). 



(4.6) 



This statement is obvious, since we can express the integral at the left-hand side of 
(4.6) as n k E J A ji n ( du\) . . . fi n (duk) = n(n — 1) • • • (n — k + l)^ k {A), and this implies 
relation (4.6). The above identity holds, since to calculate the integral at the left- 
hand side of (4.6) we have to count the number of such k tuples (^ , • • • , for which 
, . . . , £j s ) e A and apply the identity P ((^ , . . . , £j fi ) G A) = (i k (A) . (Let us remark 
that at this point we heavily exploited that in the definition of the random integrals 
J n ,k we deleted the diagonals from the domain of integration.) 

To complete the proof of Lemma 1 it is enough to check inequality (3.10). For this 
aim let us observe that because of the factors r u — 1 in (3.9) we may assume in this 
formula that r u > 2 for all 1 < u < s, hence s < [|] , where [x] denotes the integer part 

s 

of the number x. Beside this, we shall apply the rather rude inequality n (r u — 1) < A k 



u=l 



if r u > 1, 1 < u < s and ^2 r u = k. Thus we get that 

u=l 



B n ,k\ = 



_J_ £{-1)*- ( U ) si £ (n - 1) • • • (r s - l)5( ri , . . . , r.) 

s=l ^ ' (n r„) 



(ri,---,r s ) 
rj is integer Vj>l, l<j<s 
SiH hr s =fc 



^ fc!^^" ( " _1) "' (n_S + 1) 



E 



s=l 



B(n,...,r a ). 



(n,...,r s ) 

rj is integer Vj>l, l<j<s 
siH \-r s =k 



(4-7) 

To estimate the above expression B(n, k) let us first show that the inner sum at the 
right-hand side of this expression satisfies the inequality 



E 



B(r u . . . , r s ) < 2 k s k -% for all 1 < s < k. 



(4.8) 



(ri,--.,r s ) 
rj is integer rj>l, 1<j<s 
SiH hr s =fc 



Indeed, the expression at the left-hand side of (4.8) equals the number of partitions of 
the set {1, . . . , k} into s non-empty sets. This can be bounded with the help of the 
following construction: Choose a set {u±, . . . , u s }, 1 < u\ < U2 < • • • u s < k, what can 



19 



be done in (^) way. Let us construct a partition D±, . . . , D 8 of the set {1, . . . , k} so that 
Uj G D s , 1 < j < s, and distribute the remaining k — s points of {1, . . . , /cjamong the 
sets Dj, 1 < j < s. This can be done in s k ~ s way. Beside this ( fc ) < 2 k . We have 
defined no more than 2 k s k ~ s partitions. I claim that we have constructed all possible 
partitions of s elements. Indeed, let us put the elements of a partition Di, . . . , D s in 
such an order that j < j' if the smallest element Uj of Dj is smaller than the smallest 
element uy of Dy. This partition will be obtained if the set {ui, . . . ,u s } is chosen in 
the first step of the above construction, and then the remaining terms are appropriately 
distributed among the sets of partition. 

Relations (4.7), (4.8) and the condition 1 < k < § yield that 



k] \k\ \k\ 



l nMS kln k / 2 ^ ~k k ^ S k k/2 2^{ k J ^ 



k k / 2 

s=i s=l s=l 

with some appropriate constants C > and C > 0. This means that formula (3.10) 
holds. Lemma 1 is proved. 

Now we turn to the proof of the diagram formula described in Lemma 2. Let me 
remark that similar results were already proved for random integrals with respect to 
a Poisson process, (see [8]). These two problems and their possible proofs are similar, 
although in the present problem some additional technical difficulties appear because 
of the absence of independence for the empirical measure fi n of disjoint sets. 

The proof of Lemma 2. Let us consider first the case when f(x±,..., Xk ± ) = Yl (xj), 

g(x±, . . . , Xk 2 ) = 1 1 Ib , ( x j')i the sets Ai, . . . , A^ x are disjoint and the same relation 
i'-i 3 

holds for the sets B±, . . . , B^ 2 . (On the other hand, the sets Aj n By may be non- 
empty. We denote again the indicator function of a set A by Ia(x).) Let us choose a 

parameter e > which we shall let go to zero and consider a partition Aj = [j A(j, s) = 

s=l 

U A(j, s),l<j<k u By = U B{f, t) = [f B{j', t),l<j'<k 2 , of the sets Aj 

s=l t=l t=l 

and By such that /j(A(j,s)) < e, n(B(j',t)) < e, for all 1 < j < h, 1 < j' < k 2 , 1 < 
s < u(j), 1 < t < v(j'), and beside this either A(j, s) = B(j', t) or A(j, s)f]B(j', t) = 
for all pairs (j, s) and (j', t). We can write 



. l (ki+k 2 )/2 fc i 

Jn,kAf)Jn,kM= ' fci!fc2 , Yl HiMAU^Sj^-^Aij.Sj))) 

(si,...,Sfc 1 ) : (ti,...,tfe 2 ) : J = l 
l< Sj <u(j), l<ty<v{j'), 
l<J<fcl l<j'<k 2 



H (p n (B(j',ty))-^B(j',ty))). (4.9) 
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Let B(Af) = B{M, fei, k 2 ) denote the set of all diagrams with first row {1, . . . , ki} and 
second row {1, . . . , k 2 }. We rewrite formula (4.9) with the help of these diagrams by 
putting the products in the sum in (4.9) into certain groups in dependence on the 
behaviour of the indices defining them. Put 

(k 1 +k 2 )/2 

JnMf)Jn,kM = k lk j E Z « (4-10) 

1- 2 ' B(Af)eB(Af) 



with 



Z M = Z M {e) f[ (» n (A(j, 8j )) - »(A(j, Sj ))) 

(si,...,a kl ), (ti,...,tfe 2 ) : J = l 
l<Sj<u(j),l<j<ki, 

i<V<«(j"),i<j"<*2, (4.11) 



n MB(j',t r ))-iJL(Bti',t r ))), 



where means that summation is taken for such pairs of vectors (s\, . . . , s^) and 
(ti, . . . , tk 2 ) for which A(j, sj) = B(j', tj>) if (j, j') is an edge of the diagram B(M) and 
A{j, sj) fl B(j',tj>) = if (j, /) is not an edge of the diagram B(Af). If there are no 
vectors (si, . . . , and (ti, . . . , tfc 2 ) with such a property, then we define Zyv" = 0. 

Let us consider a diagram B{M) G B(Af) with edges A/" = {(ji, ■ ■ ■ , (jl,j'i)} and 
consider an approximation Zjy of Zyy/- which we obtain in the following way: If (j, j') G M 
then we replace the term (fj, n (A(j, Sj)) - fJ,(A(j, Sj)))(n n (B(f ,tj>)) - fi(B(j' ,t r ))) = 

2 1 

(lJ, n (A(j, Sj)) — ^(A(j, Sj))) by —fji n (A{j, s)) and do not change the other terms. (The 

following heuristic argument may lead to the introduction of such an approximation. 
The expression (fj, n (A(j, Sj)) — [i{A{j, Sj))) 2 can be well approximated by fj, n (A(j, Sj)) 2 , 
the other terms being negligible. On the other hand, fj, n (A(j, Sj)) 2 ~ ^fj, n (A(j,Sj)), 
since the probability of the event that at least two sample points £j fall in a set A(j, Sj) 
of small /it measure is negligibly small. Hence //^(A(j, Sj)) — —fx n (A(j,Sj)) = with 
probability almost 1.) More explicitly, we define Zj^ for all B(Af) G B(Af) as 

Z M = Z M {e)= II (Vn(A(j, Sj ))-fi(A(j, Sj ))) 

(si,...,s fcl ), (t!,...,t k2 ) : j: jE{l,...,k 1 }\{j 1 ,...,ji} 
l<Sj< U (j), l<J<fei, 
l<t jf <v(j'),l<j'<k 2 

II (-Vn(M3m,S jm )) 
m=l ^ 

(^(B(/,V))-M5(/,V))), (4-12) 

j': j'i {1 A : ,; [j' ./;] 
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I claim that for all B{M) G B{M) the inequality 



\Z M {e)-Z Ar {e)\<C{k 1 + k 2 ,n)e (4.13) 

holds with some constant C(k\ + k2,n) < oo which does not depend on e. We do not 
need a good estimate for this constant. What is important for us is that relations (4.10) 
and (4.13) have the consequence 

(fei+fe 2 )/2 

J n , kl (f)Jn,kM = ^ Q , ,, , E ( 4 - 14 ) 

£ ^ 1- 2 ' B(N)eB(N) 

Before proving relation (4.13) let us show that formulas (4.14) and (4.12) imply relations 
(3.11) and (3.12) in the special case we are considering. To do this we express Zj^(e) 
(which as we shall see actually does not depend on the parameter e) in an appropriate 
form. To do this let us write the terms in the middle product of (4.12) in the form 

^n(A(j m ,s jm )) = ±((ii n (A(j m , Sj J)- v(A(j m ,s jm ))) + v(A(j m ,s jm ))), and let us 
rewrite the terms Zjy as 

Z/V = n " / II R 3tJ'J ° 9(X1, • • • , X kl+k2 ) Yl (Vn(dXj) - n( dXj)) 

*=1 /: r { ■ '• : I !./: // ! 

Y[ {Hn{dx r ) -fi(dxj')) (4.15) 

./": j't ( 1 !■ >.} !/' /H 

I 

Yl {{dn n {x jB ) - dn{x js )) + dn{x js )) , 

S=l 

where ■ ■ ■ are the edges of the diagram B(j\f), and the function fog and 

operators Rj tj f are defined in formulas (3.4) and (3.3). 

Given a diagram B(j\f) let us introduce the class of coloured diagrams C(B(J\f)) 
consisting of all coloured diagrams B(J\f,J\fi) we obtain by colouring the edges of the 
diagram of B(j\f) by +1 or — 1. By carrying out the multiplications in formula (4.15) by 
expressing the last line of this expression as the sum of dfx n (xj s ) — dfi(xj s ) and dfi(xj s ) 
we can rewrite Z_\f with the help of the coloured diagrams in C(B(j\f)) as 

^ -i(h + k 2 - 1 - p)l ,- . 

Z ^~l^ 1^ 71 n (fci+fc 2 -i-p)/2 JnM+k2-l-p{J °9 B {M,N,)) 

P=iBCA/Wi)ee(B(-AO), 

and |A/*i \=p 



The last relation together with formula (4.14) yield that 

min(fei,fc 2 ) / 

n('-f)/ 2 /ci!/c 2 ! 



Jn,kAf) J n,kA9) ~ EE E ~ lM-p)/2h Ifr^ J n,k 1 +k2-l- P (f°9B(^Mi))- 



i=o P =o B{M,Ni)eB{i, P ) 
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This relation is equivalent to the diagram formula described in formulas (3.11) and 
(3.12). Hence to prove Lemma 2 in the special case considered now it is enough to 
prove formula (4.13). 

First we formulate an inequality and show that formula (4.13) follows from it. To 
formulate it let us introduce the following five (random) functions p( m \ 1 < m < 5, 
defined on the measurable sets A G X of the space (X, X). Put p^'(A) = p, n (A), 
p(*\A) = —fJ>(A), pW(A) = -2vL(A)fr{A), pW(A) = p{Af and p^\A) = p n {Af - 
—fji n (A) for all A G X. Let us fix some positive integer r and r disjoint subsets Cj G X 
of the space X, 1 < j < r, such that p.(Cj) < e for all 1 < j < r with some r > 1. Let 
us choose some sequence of integers m(j), 1 < j < r, such that 1 < m(j) < 5 for all 
1 < j < t and m(j) > 3 for at least one of these indices. I claim that 

r r 

Slj|p mW) (Ci)| <eC(r,n)IjMCi) (4.16) 

J=l 3=1 

for the above defined functions p^ with some appropriate constant C(r, n). (One could 
get rid of the dependence of n in the constant C(r, n) but such an improvement of the 
inequality (4.16) has no great importance for us.) 

First I show that relation (4.13) can be proved with the help of the estimate (4.16). 
For this aim I rewrite the expressions Zjsf and Zjsf with the help of the above defined func- 
tions p( m ^ . To get a good representation let us first consider those pairs (j, j') for which 
(j, j') G N and write the terms of the form (fi n (A(j, Sj)) — p,(A(j, Sj)))(fi n (B(j' , tj>)) — 
p,(B(j', tj'))) = (fi n (A(j, sj)) — p,(A(j, Sj))) 2 in the expression (4.11) with such indices j 
andj' by means of these new functions as p( 5 \A(j, Sj)) + ^p^(A(j, Sj))+p( 3 \A(j, Sj)) + 
p^(A(j, Sj)). Similarly, the terms ^p n (A(j, s)) in the expression (4.12) can be rewrit- 
ten as - p^\A(j , s j)) . If j is not an end-point of an edge in the first row of the 
diagram A/", then let us rewrite the terms of the form p n (A(j, Sj)) — p(A(j, Sj)) as 
p^\A(j, Sj)) + p( 2 \A(j,Sj)) both in formula (4.11) and (4.12). Similarly, if j' is not 
an end-point of an edge in the second row of the diagram A/", then let us rewrite the 
terms of the form p n (B(j' ', tj>))- fi(B(j' , tj>)) as pW(B(j', t J /))+p (2 )(S(j / , t r )) in these 
formulas. 

Let us consider the difference Zj^f — Z^f rewritten in the above form as the function 
of the terms p m ^\A 3 , Sj ), l<j< hand p m ^'\B r ,t r ), f G {1, . . . , k 2 } \ . . . , 
In such a way we can express Zj^ — Z^ as the linear combination of some terms with 
coefficients between zero and 1 which have the form 

fci 

Ylp m ^(A(j, Sj )) p^'\B(j',t r )), (4.17) 

s=1 j'e{l,...,k 2 }\{3[,-,3[} 

where . . . is the set consisting of the end-points of the edges of the diagram 
B(N) in the second row. Beside this, it follows from the above construction that in 
the terms described in (4.17) m(j) > 3 for at least one of the indices ji, . . . ,ji, where 
ji, . . . ,ji are the end-points of the edges of the diagram j\f in the first row. Since by 
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our assumption the inequalities n(A(j,s)) < e and p,(Bj/,t) < e hold for all terms we 
consider, formula (4.16) can be applied with the choice r = k± + k 2 — I to estimate all 
terms of the form (4.17). It yields that 

E l[p m(j) (A(j, Sj )) J] p™(i'\B(j',t r )) 

s=1 j'6{i,...,fc 2 }\{j;,...,j;} 
fci 

< sC(k 2 , n) J] p(A(j, Sj )) J] p(B(j', t r )) 

s=1 j'6{i,...,fc 2 }\{j;,...,j;} 

n au, Sj ) x n 

»=i j'e{i,...,fc 2 }\{ji,...,j l '} 

where p k ^+ k ^~ l denotes the /ci + /?2 — /-fold direct product of the measure p with itself 
on the space (X kl+k2 ~ l , X kl+k2 ~ l ), and the product at the right-hand side of (4.18) 
denotes the direct product of the corresponding sets. 

The estimate (4.13) follows from inequality (4.18) if we sum up this estimate for 
all such terms of the form (4.17) which appear in the difference Zjv — Zjy when it is 
written in the above described form. We can see this if we observe that any two sets 
A(j, s) and B(j', t) appearing in one of these expressions are disjoint unless they are 
both functions of the same type A(-, •) or B(-, •), and their arguments (j, Sj) or (J', tj>) 

fci 

agree. This fact implies that those sets Yl A(j,Sj) x fj B(j',tj') at 

»=i ' j>e{i,...M}\{j[,--;j[} 
the right-hand side of (4.18) whose p k ^+ k ^~ l measures have to be summed up to get an 
estimate for the difference E\Zj^ — Zj^\ are either disjoint or they coincide. Since each 
term appears less than 5 kl+k2 times in one of these estimates this means that formula 
(4.16) implies formula (4.13) with the choice C(ki + k 2 , n) = 5 kl+k2 C(ki + k 2 , n). 

To prove formula (4.16) let us express the functions p^'(Cj), p^{Cj) and p^\Cj) 
with the help of the measures v\ defined as 

ri \iii(uj)^A 

ui{A) = vi(A,u) = { ; for all A e X and 1 < / < n. 

In O n 

We can write p W (Cj) = - £ M<^)> P^\Cj) = —p(Cj) £ ^(Cj), and 

n h =i n lj=1 



n / n \ 

^ (5) (^) = -2 E - - E ^Cj) ) =--2 E 

lj=l \ lj=l J l<I,-,j;-<n,I,-?4I,-/ 



Let us observe that only pairs (ljjj>) with lj ^ ly are considered at the right-hand side 
of the last formula, since i>i{C) — vf{C) = for all C G X. 
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Let us insert these formulas to the left-hand side of formula (4.16), carry out the 
multiplications, and let us bound the absolute value of the sum obtained in such a way as 
the sum of the absolute value of these terms. All these terms are products of expressions 
of the form Pij(Cj), Pi.,(Cj) or n(Cj). Let us observe that only those products have 
to be considered for which the indices L and L> of the measures v\, and v\., are all 
different for different indices j. Otherwise the product we consider equals zero. We 
can see this by observing that the sets Cj are disjoint. Therefore if lj 1 = lj 2 for some 
ji 7^ J2, then either &i. (C^) = or Pi j2 (Cj 2 ) = 0. On the other hand, the measures 
Pi are independent for different indices /, and this implies that the terms we have to 
bound are the products of r independent random variables (some of these variables may 
be constant). Since the sum we have to bound contains less than n r terms to prove 
(4.16) it is enough to show that the expectation of the absolute value of these terms can 

r 

be bounded by const. £ fj fJ>(Cj). Since Evi^Cj) = Evi,{Cj) = /z(C 3 -), the product 

whose expected value has to be estimated is the product of r such independent random 
variables whose expected values can be bounded fJi(Cj), 1 < j < r. Moreover, there is 
at least one index j for which m(j) > 3 in the expression (4.16). This property together 
with the condition fJ>(Cj) < e imply that the product whose expected value we have to 
estimate contains at least one factor whose expected value is less than 2efi(Cj). This 
means that the inequality needed for the proof of relation (4.16) holds. 

Let us observe that both sides of formula (3.11) is a bilinear form of the functions / 
and g. This fact implies that it holds not only for those pairs (/, g) of functions we con- 
sidered before, but also for any finite linear combinations of such functions / and g. Such 
linear combinations are dense in the spaces L2(X fcl , X kl , fi kl ) and L2(X k2 , X k2 , (i k2 ) 
respectively. Hence to complete the proof of Lemma 2 it is enough to show that if a 
sequence of functions f n and g n satisfy the relations ||/ n — f\\2 — > and \\g n — g\\2 — > 
with some functions / and g, then 

lim E\J nkl (f n )J nk2 (g n ) - J nk (f)J n ,k 2 (9)\ = (4.19) 

n^oo 

and 

lim E\J n k 1 +k 2 -l-p(fn ° 9n)B(Af,Afi) ~ ^n,fci+fc 2 -/-p(/ ° 9)b(M,Mi) I = (4.20) 
n— »oo 

for all coloured diagrams B{N,N\) G B{l,p), < p < I < min(/c!, k 2 ). 

To prove relation (4.19) it is enough to prove that E^n^ifn) ~ Jn,k 1 (f)) 2 = 
E{{J nM {fn ~ f)f - if ||/„ - f\\ 2 - 0, and E(J nM (g n - g)f - if ||<7 n - g\\ 2 - 0. 
Let us prove the inequality E(J n ^(h)) 2 < C(n, ki)\\h\\2 with some constant C(k, n) for a 
function h of k variables. The relation we want to prove is a consequence of this inequal- 
ity. On the other hand, this inequality can be deduced from some arguments presented in 
the proof of Lemma 1. Indeed, similarly to the argument presented there we can reduce 
the statement to be proved to the inequality f h 2 (xi, . . . , x/ c ) / u n ( du\) . . . fx n ( du k ) < 
C(n, k)\\h\\2 with an appropriate constant C(n, k) for any function h of k < k variables 
such that \\h\\\ < oo. But this statement is contained in formula (4.5). We only have 
to apply it for the integrable function h 2 . 
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Relation (4.20) can be deduced from the results proved at the end of Lemma 1 in 
an even simpler way. Because of the results proved there it is enough to check that 

\\{fn° 9n)B{N,Ni) ~ (/ ° 9)b{N,Mi)\\i ~* if ll/r» ~ /111 ~* and \\9n ~ g\\l ~> °- 0n the 

other hand, it follows from the Schwarz inequality and the contraction property of the 
operator P defined in (3.2) in the L2 norm that 

\\U °9n)BMMi) ~ U 9)b{M,Ni)\\i = \\(f ° (9n~ g))B(M,Afi)\\l - H/lllllS ~ 0n||l» 

hence ||(/ o g n )B{M,Mi) ~ (/ 9 )b{M,Mi) Hi ~* °- Similarly, 

|| (/n °9n)B{MMi) - {fn°9n)B{MMi)\\ 1 ~* °- 

These inequalities imply relation (4.20). Lemma 2 is proved. 

Lemma 3 which gives an estimate on the second moment of a random integral 
Jn,k(f) can be proved as a consequence of Lemmas 1 and 2. 

The proof of Lemma 3. By Lemma 2 we have 

ej «m? = EE jj^F^g n ' v '' ) " EJ '- a -'-' u ° 

with the functions / 'o fi p defined in formula (3.12). Hence by formulas (3.8) and (3.10) 
in Lemma 1 

ft m 2 <V>T (2k-l- P )\ (l _ p)/2 C^-v 

*Jn,kAS) S 2^2^ {k _ /)!2(/ _ p)lp] n {2 k-l- p)(*k-i-p)/* V UAl 

(4.21) 

with some constant C > 0. We claim that ||/ o // )P ||i < H/Hl- Since / o fi p is the 
average of certain functions of the form / o fB(N,Ni) with some diagram B(Af, A/i) it is 
enough to show that ||/ o /s(jv,ATi) II l < ll/lli f° r an arbitrary diagram B(J\f,N\). We 
get similarly to the proof of formula (3.15) by considering first functions of the form 
/ ° f B(N) defined in formula (3.5) and applying the Schwarz inequality for them that 

f°fB{N) 1 < J f 2 (x 1 ,...,x k )fx(dx 1 )...fx(dx kl ) = 



for an arbitrary diagram M. Since the operators Pj defined in formula (3.2) are con- 
tractions also in L\ norm, the last inequality implies that the relation ||/o fB{M,M\) II 1 = 
ll/ /s(Ar,M)lli < II/°/b(aoIIi - ll/lli holds > as we claimed. By formula (4.21), the 
inequality ||/ o // jP ||i < \\fW2 and the Stirling formula we get that 



EJnMff < C> t E ( f! y ^-PY-V """" P>/2 11/111 <422) 
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with a possibly new constant C > 0. 

Since the right-hand side of (4.22) contains less than k 2 terms and n > k this 
relation enables us to reduce the proof of Lemma 3 to the estimate 

(2k -i- p )( 2k ~ i -p)/ 2 < c k (k - if^- l \i - P y- p p p k^- p) / 2 k- k 

with an appropriate C > for all < p < I < k. Since (k — l) h ~ l p p (l — p) l ~ p > 
3~ k k k (this follows from instance from the convexity of the function xlogx if we take 
logarithm), to prove the last inequality it is enough to show that 

( 2 P ) <C k {k-l)^k^' 2 if0<p<l<k 

with some C > 0. By dividing both sides of the last inequality by k^ 2k ~ l ~ p ^ 2 and after 
this taking the k-th root of both sides we get that the last inequality is equivalent to 
the statement 

ou j x (2k-l- P )/2k /h ,x (k-l)/k 

TO ^(V) ifO< P <i<.. (4.23) 

Inequality (4.23) holds. This inequality holds since the left-hand side can be bounded 
by sup u u < 1 from above, and the right-hand side by C inf u u > Ce~ x l e from 

0<u<l 0<w<l 

below. Lemma 3 is proved. 

The proof of Lemma 4- Let us first investigate the main result of Lemma 4 which states 
that the function / o g is (ri + — (I — p) , a 2 ) dominated under appropriate conditions. 
Let us fix a coloured diagram B(J\f,Mi) = B(N,Ni, fei, k 2 ) with 

Af={(ji,j[),...,(ji,j' l )} andM = {C7„(i),^ ( i ) ),...,(j„(p),^ (p) )}, (4.24) 



where {w(l), . . . , C {1, . . . , /}. It is enough to prove that the function / o gB(j\f,Ni) 

defined in formula (3.6) is (ri + r 2 — (/ — p), a 2 ) dominated. We have certain freedom 
to change the order of the operators Rjj> and Pj in the definition of this function. By 
exploiting this freedom we can rewrite the function / o gs{M,Mi) as 



/ ° fi , B(AT,M)( ;z: s(l), • • • , ^s(fci+A; 2 -Z-p)) 

bf p (4.25) 

s=l t=l 

with {u(l), • • .,v(l-p)} = {1, . . .,/} \{u(l), • ..,u(p)}, i.e. the operators Rj vM j' are 
indexed by the edges 

M=M\M 1 = {tiv(i)Jv i i ) ),---,(Jv(i- P )J' v( i- p) )} (4-26) 
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of the diagram B{M,Ni). Beside this, we can choose the order of the operators 
Rj v(s) j' in an arbitrary way. Let me recall that in our notations the operators written 
after each other are applied in right to left order. 

Lemma 4 will be proved with the help of an inductive procedure described be- 
low. To carry it out let us introduce some notations. Let us consider a function 
F(x 8 (i), . . . , x s ( m )) with the set of indices S = {s(l), . . . , s(m)} C {1, . . . , k\ + k 2 } 
together with a diagram B{M) = B(M, S) consisting of two rows {s(l ),..., s(r)} and 
{s(r + 1), . . . , s(m)}, where the index r is defined by the relations s(r) < k\ < s(r + 1). 
Beside this, the diagram B(Ai, S) has some edges M. = {(ji, ■ ■ ■ , (j q , j' q )} with 
< q < min(r,m-r), 1 < ji < ■ ■ ■ j q < s(r), {ji,...,j q } C {s(l), . . . , s(r)}, 

{ji>---Jq} c i>( r + !)»•••> and J« ^ Ju' if u ^ 1 < < 9- We in- 

troduced the diagram _B(.M, £) to indicate for which pairs we want to apply the 

operator Rjj/ for the function F during the inductive procedure leading to the proof of 
Lemma 4. 

Let us call a partition Si, ... , B r , B u = {s{d(^ u ^), . . . , s(d( UjW ( u )))}, 1 < u < r, of 
the set £ = • • • , s(m)} together with a set of functions hi, . . . , h r , such that the 

function h u = h u (x s (d( U ■ ■ ■ i x s(d (u depends only on the variables indexed by 

the elements of the set B u for all 1 < u < r, and ||ft u ||oo ^ 1) ll^ulli — a2 > 1 < w < r, 
an (r, a 2 ) dominating system of the function F if also the relation 

r 

\F(x s (i), . . . , x s{m) )\ < Yl h u (x s ( d(ul)) , . . . , x s{d(uv(u)) ^j for all x s{l) £ X, l<l<m 

u=l 

holds. Let us call this (r, a 2 ) dominating system regular (with respect to the function 
F and the diagram B(A4) attached to it) if the two end-points ji and j[ of the edges 
of the diagram B(M) are contained in different sets B u ^ and B u >^ of the partition 
Bi, . . . , B r for all 1 < i < q. We call an (r, a 2 ) dominating system super-regular if 
all elements Bi, . . . , B r of the partition of this system are contained either in the set 
{1, . . . , ki} or in the set {ki + 1, . . . , ki + A^}. A super-regular (r, a 2 ) dominating system 
is clearly regular with respect to any diagram B(M), since all edges of a diagram B(M) 
are going between the vertices of the sets {1, . . . , ki} and {ki + 1, . . . , ki + £2}. 

It follows from the conditions of Lemma 4 that the function / o g(xi, . . . , Xk 1 +k 2 ) 
defined in formula (3.4) has a super-regular (ri +r 2 ,cr 2 ) dominating system. Indeed, 
the function f(xi, . . . , x^) can be dominated by a partition Bi, . . . , B ri of the set 
{1, . . . , ki} together with some functions hi , . . . , h ri such that the indices of the ar- 
guments of the function h u are contained in B u , 1 < u < ri, and the function 
g(x kl+ i,...,x kl+k2 ) by a partition B ri+U . . . , B ri+r2 of the set {ki + 1, . . . , k x + k 2 } 
together with some functions h ri+ i, . . . , h ri+r2 such that the indices of the arguments 
of the function h u are contained in B u , r± + 1 < u < r± + r 2 . Beside this ||/i u ||oo < 1 
and || h u || \ < a 2 for all 1 < u < r± + r 2 - The union of these sets and functions yield a 
super- regular (ri + r 2 , cr 2 ) dominating system of the function fog consisting of the sets 
Bi, . . . , B ri+r2 and the functions hi, ... , h ri+r2 . 

We shall prove Lemma 4 with the help of the above observation, formula (4.25) 
and some properties of the operators Pj and Rj,j>. First we show by induction that 
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the function J] P jt R juj >J o g(xi, . . . , ^fc 1+fc2 ) = / o </b(M,M) also has a super-regular 
(ri + 7*2, cr 2 ) dominating system. 

We shall prove the following statement which implies the above relation. Let a 
function G(x Sl , . . . , x Sm ) have an (r, a 2 ) super-regular dominating system and take two 
indices 1 < j, < fa < j' < fa + k 2 such that {j,j'} C {si, . . . , s m }. Then the function 
PjRj j'G also has an (r, a 2 ) super-regular dominating system. This statement enables 
us to show with the help of a simple induction that together with the function fog 



also the function / o ^b(M,M) = II PjtRjtj'f ° # nas a super-regular (ri + r2,cr 2 ) 
dominating system. 

To prove the above statement let us consider a super-regular (r, a 2 ) dominating 
system of the function G consisting of a partition B\, . . . , B r of the set {si, . . . , s m } and 
some functions hi, 1 < / < r, depending on the arguments indexed by the set _£?/;. To 
construct a super-regular dominating system of the function PjRjjiG let us introduce 
the following notations. Let u\ and u 2 denote those indices for which j e and 
j' G .B U2 , and let us denote the elements of these sets as B Ul = {j, v(l), . . . , v(m)}, 
B U2 = {j', v'(l), . . . , v'(m')}. Then we define B u = B u , h u = h u if u ^ u\ and u ^ u 2 , 
B Ul = B Ul \ {j}, B U2 = B U2 \ {j'}, and 

,1/2 

h U2 { x v'(l), ■ ■ ■ , x v'(m')) = ( / ^(^'''^'(l)' • • • i x v'{m'))^{dZji) 

if u = u\ or u = u 2 . We claim that the above defined sets Bi and functions hi, 1 < I < r, 
supply a super-regular (r, a 2 ) dominating system of the function PjRj jiG. To show this 
let us first observe that because of the Schwarz inequality 

l-^j'-^Jjj'^wi ( X ji x v(l)i • • • i x v(m))h'U2 i x j' > ^'(l) 5 • • • ? x v'(m') ) I (4 27) 

< . . . , x v ( m ^)h U2 (x v /(ij, . . . , 3V(m'))- 

Since the product of the functions ft/, 1 < I < r, gives an upper bound of the function 
G, formula (4.27) and the definition of the functions hi imply that the product of 
the functions hi yields an upper bound for the function PjRj jiG. The functions h Uj , 
j = 1,2, satisfy also the relation \\h Uj || 2 = \\h Uj || 2 < a 2 . It is not difficult to see with the 
help of these relations that the sets Bi and hi, 1 < I < r, yield a super-regular (r, a 2 ) 
dominating system of the function Rj.yPjG. 

Thus we have proved that, with the notation (4.24) and (4.26), the function 
/ ° fi , s(AT 1 ,M)( ;r s(i) 5 • • • > x s(k 1 +k 2 -2 P )) together with the diagram B(M) attached to it, 
where M = Af\ M = {(j v (i), f v (i)), • • • , Uv(l-p)J' v (i- p -))}, and the indices of the argu- 
ments of the function / o gB(Mi,Mi) are 

{s{l),...,s{fa -p)} = {1, ■ ■ ■ , fa} \ {j u{1) , . . . ,j u{p) }, 

{s(fa -p+l),...,s(fa + k 2 - 2p)} = {fa + 1,. ..,fa + k 2 }\ Ou(i)»-- -ijuip)}' 
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has a regular (ri + r 2 , a 2 ) dominating system. We want to prove that the function 



i-p 

/ ° 9b{MMi) = II R Mm),f v(m) f ° »s(M,M) ( 4 - 28 ) 

m=l 

has an (r± + r 2 — (/ — p), a 2 ) dominating system. Let us recall that \M \ = I —p and the 
operators R~ , s ,•/ considered in formula (4.28) are commutative. 

The above statement will be proved by the successive application of a statement 

formulated below. Let a function G(x s ^, . . . , x s ( m )) and a diagram B(M) attached 

to it have a regular (r, a 2 ) dominating system consisting of some partition Bi, . . . , B r 

of the set {s(l), . . . , s(m)} and some functions hi, 1 < I < r, depending on variables 

indexed by the set Bi. Let us consider two sets Bi and By such that there are edges of 

the diagram M whose end-points are in the sets B\ and By . By a new enumeration of 

the sets of the partition if it is needed we may assume that / = r — 1 and V = r. Let 

Mo = {(s(l), s(l)'), . . . , (s(q), s(q)')}, q > 1, denote the set of all edges of M which 

connect some points of the sets B r _\ and B r . Here we do not specify which element 

of a pair (s(t), s(t)') is in the set B r -\ and which element is in the set B r: 1 < t < q. 

q 

We claim that the function Yl R s (t),s(t)'G together with the diagram B(M \ Mo) has 

t=i 

a regular (r — l,cr 2 ) dominating system. (Let us recall that the diagram attached to 
the function G has the role to list the pairs (j,f) for which we want to apply the 
operator Rjj>.) More explicitly, we can get a regular (r — l,a ) dominating system 

(£?;, hi), 1 < I < r — 1, in the following way. Put Bi = Bi and hi = hi, 1 < / < r — 2, 

<? 

S r _i = -B r _i U B r \ {s(l)', . . . , s(q)'} and /i r _i = j] R s ^ s ^ t yh r -ih r . It is rather 

t=i 

straightforward to check that the above sets Bi and functions hi, 1 < I < r — 1 constitute 
a regular (r— 1, a 2 ) dominating system. The only statement which demands some special 
consideration is the inequality ||/i r -i||2 ^ °" 2 - We can see with the help of the Schwarz 
inequality (similarly to the corresponding argument applied in the proof of formula 
(3.14)) that 



UK- 



l|2 

1II2 



II Rs(t),s(tyh r -ih T 



t=l 



< \\hr-ih\\h r \\2 < o 
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By applying the above fact for the function at the right-hand side of (4.28) s < I —p 
times we get that the function f ° gB{N,N±) ls ( ri + r 2 — s ^ 2 ) dominated with some 
s < I— p. (At each step we take into consideration at least one new operator Rj v{m) ,j' { 

in formula (4.28), so the procedure finishes in s < I — p steps.) This property implies 
that the function f ° 9b{N,M-l) ^ as an ( ri + r 2 ~ s i°~ 2 ) dominating system with some 
number s < I — p. To give a complete proof for the main result of Lemma 4 let us 
observe that in the case of a 'too good estimate' when s < I — p we can construct an 
(r, a 2 ) dominating system of the function / o gB(M,Mi) w ^ n r = ri + r 2 — (I — p) . We 
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get such a system by taking the union of some sets B[ of an (r, a 2 ) dominating system 
if f > r, and by considering the product of the functions h\ corresponding to them. 

Finally, the last statement of Lemma 4 is a simple consequence of formula (3.14). 
Indeed, if / is (ri,a 2 ) and g is (r2,o" 2 ) dominated, then \\f\\ 2 < o" 2r \ \\9W2 — a2r2 > 
hence ||/ o gB(jv,ATi) III — & ri+T2 by relation (3.14). On the other hand, ||/||oo < 1 and 
\\g 1 1 00 < 1, hence ||/ o gB(M,Ni) 1 1 00 < 1- These inequalities mean that / gB(M,Ni) is an 
( 1(T (ri+r 2 )/2) dominated function as we claimed. 



5. The proof of Proposition 2 together with a technical lemma 

First I prove Proposition 2 with the help of Lemma 5 formulated in Section 2. Then I 
also prove Lemma 5. 

The proof of Proposition 2 with the help of Lemma 5. By Lemma 3 formula (3.17) holds 
for m = and all k = 1,2,... with the choice C(/c, 0) = C k with some C > 0. Let 
us assume that it holds for some m and all k > 1 with appropriate norming constants 
C(/c,m). Then I prove it by induction for m + 1 with appropriate norming constants 
C(k,m+ 1). 

To do this let us fix some function f(x±, . . . , x k ) of k variables, consider the functions 
/ o fi v defined in formula (3.12) and give a good bound for the 2M = 2 m+1 -th moment 
of the random integrals J n ,2k-i- P (f fi, P )- Then the AM = 2 m+2 -th moment of the 
random integral J n ,k(f) (in the case n > 2kM) can be estimated by means of these 
bounds and the diagram formula. 

Let / = f(xi, . . . ,Xk) be a function of k variables, k > 1, which is (r, a 2 / r ) domi- 
nated with some r > 1. I show with the help of Lemma 4 and formula (3.17) with the 
parameter M = 2 m that the inequality 

/ -(l-p)/2j /, , \2M / C(2k-l- P ,m) \ M ((9M ,k 2\2M 

E[n Jn,2k-i- P (l h,p) ) < [j2k - / - pfk-i- P ) U 2M ) a ) 

-2kM ( / 2kM x 2Mmin ( fc > r ) x ' ! 



max 1, —— , 0<p<l<k 



(2k)( l ~p) M M 2lM I ' \na 2 / r 

holds if 2kM < n. This formula also holds in the degenerated case p = I = k with the 
choice C(0, m) = 1. Indeed, in this case 2k — p — I = 0, hence / o f k k is a constant, 

and EJ nj o(f o fk,k) 2M = (f fk,k) 2M < (a 2 ) 2M . The last inequality follows from the 
definition of the function / o f k k = f o fB(U,M)i \^\ = given in formula (3.12) and 
the estimate \f o fB(M,Ni)\ — II /II 2 ^ °" 2 if A/" = A/"i, |A/"| = k, (k is the number of 
variables of the function /). The last inequality follows from relation (3.15). These 
relations imply that \ f o f k ^ k \ < a 2 . We had to consider this case separately, because 
our inductive hypothesis (3.17) is formulated only for functions with k > 1 variables. 

The integral J n ,2k-i- P (f fi, P ) is the average of the random integrals J n ^k-i- P (f 
fB{N,Ni)) witn tne functions fB{M,Mi)i x ^ • • • ' x 2k-l- P ) such that B{J\f,N\) G B(l,p). 
Hence the triangular inequality for the L2M norm in the probability space (O, A, P) 
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where the random variables £j are living implies that it is enough to prove such a version 
of formula (5.1) where the random integral J n ,k(f ° fi,p) a t the left-hand side is replaced 
by an arbitrary random integral J n ,2k-l-p(f ° /b(a/",M)) sucn that B(H,Hi) e 13(1, p). 

Let us apply formula (3.17) for a random integral J n ,2k-l-p(f ' ° J b(M ',M)) sucri that 
B(H,H\) G 13(1, p). (In this case (2k — l—p)M < n, and our inductive hypothesis allows 
the application of formula (3.17).) Let us consider the cases 2r > / — p+ 1 and 2r < l—p 
separately. In the first case the function / o Jb(M,Ni) ls ( 2r ~~ ^ ~^Pi °" 2 ^ r ) or in other 
notation (2r — I + p,a 2 ^ 2r ~ l+p ^) dominated with a = a ( - 2r ~ l+p ^ r by Lemma 4. Let us 
apply relation (3.17) with the choice k = 2k — / — p, f = 2r — I + p and a = a < - 2r ~ l+p ^ r 
in this case. Observe that cr 2 / r = a 2 / r . We get that 

( _ ( ,_ p)/2 , , X 2M ^ C(2k-l-p,m) \ M , k 2 .2M 

E(n' "» Jn,2k-l-p(fof B {MMi))) < [ j2k - I - p )2k-i-p J {( 2M > a ) 

2~2kM ^ / / 2fcM \ Mmin ( 2fc_Z ~P' 2r ~'+P)' 

• M (Z+ P )M ( n(T 2/r)(Z-p)M I (^T^J 



C(2k — l—p,m) \ M //r,7i /f \fc 2\ 2M 



((2M)V)' 



(2fc - I - pfh-l-p 

2 -2fcM / / 2A;M \ (i-p)M / 2A;M X M min{2k-2p,2ry 

' M 2ZM( 2 fc)(i-p)M maX ^ [j^F ) \^F) 

M 



( C(2k-l-p,m) \ ((0M ,k2^M 
~ {(2k - I - p^-i-P ) « 2M )") 



2~2kM / / 2 i;_/Vf \ 2Mmin ( fc ' r ) 



max !. 37277- > ( 5 - 2 ) 



M 2lM (2kY l ~P) M \ ' Vna 2 / r 

since the exponents (/ — p)M and min(2/c — 2p, 2r)M in the last but one inequality of 
formula (5.2) satisfy the inequality < (/ — p)M < min 2M(/c, r), and < min(2/c — 
2p, 2r)M < min2M(/c, r). 

In the second case when 2r < / — p we know by Lemma 4 that the function / o 
fB(N,Ni) * s (l' " 2 ) dominated. In this case we apply formula (3.17) with the k = 
2k — I — p, f = 1 and a = a. We get that 



2~2kM | 
M (l+ P )M a 2M n {l-p)M maX 

M 




2 -2fcM // x \M , 2A , M 



max 



MV+V)M I V n (l-p) a 2 I ' \ n i-p+l cr 4 
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I claim that 

/ 1 2kM \ 1 ( {2kM\ 2r \ 

Relation (5.4) together with formula (5.3) and the relation k > r if 2r < I — p yield that 
in this case also the inequality 

E (n-«-^J. mU o f BWM)) y M < ( ( ^-' p p-Tl ) " (WW™ 

max 1, — ^j- j (5.5) 



M 21M (2k)V-P)M \ ' V?l(T 2 / r 



2M 



holds. Relations (5.2) and (5.5) give a bound on E {n~^~ p ^ 2 J n ,2k-i-p(f b{N ,Ni))) 
in both cases 2r > / — p + 1 and 2r < / — p, thus they imply formula (5.1). Hence to 
complete the proof of formula (5.1) it is enough to check relation (5.4). 

Actually I prove a stronger inequality than formula (5.4). I show that if the in- 
equality 2kM < n holds, and this was assumed among the conditions of relation (5.1), 

1 / 2kM N " ; 



then the left-hand side of (5.4) can be bounded by . — — ttt i i-e. by the 

second term of the maximum at the right-hand side of (5.4). This follows from the 
following estimations: 



n 



1 _ 1 (2kM\ ir / n \2r-(i- P ) 2 1 (2kM\ 



and 



2kM [2kM\ 2r 1 / n \2r-(z- P )-i ( 2kM\ 2r 1 



rr 



1 (J?-) 

M) l ~P \2kMJ 



< 



i-P+i a 4 \na 2 / r J (2kM) l -P \2kM J ~ \na 2 / r J (2kM) l ~p' 



since 2kM < n, 2r - (I - p) < and a 2 < 1. 

The diagram formula (3.11) together with the triangular inequality in the L2M 
norm yield that 



EM/)™ = b (e e (t ( ")p ( 'r-p)'ipi "" ( '" rt/ ' ■ j - 2 '-'- p(/ /, " ,) 



2M 



< 



(5.6) 
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Relations (5.1) and (5.6) imply the inequality 



E(J n Mf) 4M <((2M) k a 2 ) 2M m^U( 



2kM x 2M min ( k ' r y 



na 2 / r / 

2M (5-7) 

x ^ x ^ K *k-l-p)\ ( C{2k -l-p,m) y (2k)-^-P) /2 x 
T~^p~o {k-l)\ 2 {l-p)\p\ \{2k - I - pfk-i-v J 2 k -M l 

for the random integral J n ,k(f) of an (r, a 2 / r ) dominated function /. By writing the 
term M l in the form 2 ml in (5.7) and comparing formulas (3.17) and (5.7) we can see 
that relation (3.17) also holds for the new parameter m + 1 with some new (positive) 
constants C(k, m + 1), k = 1,2,..., if we can choose them in such a way that they 
satisfy the inequalities 



C(fc,m + 1) Ay, (2k-l-p)\ ( C(2k-l-p,m) \ 1/2 (2k)-^/ 2 
P - ^ ^ (k - Z)! 2 (Z - p)\p\ \(2k-l- pfk-i-v J 2 k ■ 2 lm 



for all k = 1,2, ... . We show with the help of Lemma 5 that it is possible to choose 
such constants C(k,m), k = 0, 1, . . . , m = 0, 1, . . . which satisfy both relation (5.8) and 
the inequalities sup C(k, m) < Ck < oo for all k = 0, 1, 2, ... , and beside this also 

0<m<oo 

the inequalities C(0, m) > 1, m = 1, 2, . . . , and C(/c, 0) > C fc , k = 0, 1, 2, . . . , hold for 
all k = 0, 1, . . . with a prescribed number C > 0. (The last two inequalities are needed 
to satisfy relation (5.1) in the degenerate case k = and relation (3.17) for the starting 
step m = 0.) The proof of the above statement completes the proof of Proposition 2. 

To find a sequence C(n,k) satisfying relation (5.8) first I show that a sequence 
C(n, k) satisfying Lemma 5 also satisfies the system of inequalities 

C{k, m + l) 2 > ( - {2k ~l~ P) l , ) 2 —, e \ oh , ^} Hl ~ P) C{2k -l-p,m) 

V' > ~ \{k-l)\ 2 {l-p)\p\J (2k - I - p) 2k ~ l -P 2 2k ■ 2 2lm v Fl ' 

for all < k < oo, and < m < oo and < p < I < k. 

(5.9) 

(Formula (5.9) is a weakened version of (5.8) where it is demanded that if the double 
sum at the right-hand side of (5.8) is replaced by the maximum of these terms, then 
this new expression should be smaller than the left-hand side.) 

Indeed, to show that a sequence satisfying Lemma 5 also satisfies (5.9) observe that 
by the estimate (^) < 2 n for all < m < n and the rather rough bound 2 2 '/! > l l 



W-l-p)\ <{2k _ l _ _y- P (2k - 2A 1 (I 
[k-l)\ 2 {l-p)\p\ ~ { K 1 P) \k-l )l\\p 



< {2k - i- p y-^ k - l ^2 l = 2 2l 22k{2k I p)l v , 
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and this estimation implies that inequality (5.9) follows from the relation 

1 2 4l (2k) 2k - l+ P(2k-l-p) 3l -P- 2k -, n1 , 
C ^ m + 1) > — ^ C(2/C " 1 - P > m) 

i.e. from inequality (3.18) imposed in Lemma 5. 

We claim that the numbers C(k,m) = AB k C(k 1 m) defined with the help of the 
above numbers C(k, m) and sufficiently large constants A > 1 and B > 1 satisfy formula 
(5.8) together with the additional properties we have imposed. Indeed, we get with such 
a choice that 

(2k-l-p)\ f C(2k-l-p,m) \ 1/2 (2fc)-('- p )/ 2 
^ OP ~o {k-l)\ 2 {l-p)\p\ \{2k - I - p)2k-i- P ) 2 k -2 lm 

<pEE [AB^-'-'c( k , m + 1) 2 ] 1/2 = C( *^+ 11 £ £ b- (,+ " /2 

Z=0 P =0 VAft !=0 P =0 

C(fc,m + 1) C(fc,m + 1) 




y/Ak k \ti ~ k k 



if we choose the number A = A(B) sufficiently large. Because of Lemma 5 the num- 
bers C(k, m) also satisfy the relation sup C(k, m) < < oo with an appropriate 

0<m<oo 

constant Ck- Beside this, if we choose B > C, then because of the relation C(k, 0) = 1 
formulated in Lemma 5 also the relation C(k, 0) > C k holds for all k = 0, 1, ... . The 
relation C(0, m) = AC(0, m) = A > 1 also holds. The proof of Proposition 2 with the 
help of Lemma 5 is finished. 

The proof of Lemma 5. Let us introduce the numbers 

A(l, p, k, m) = 2 2 « 4 -™> {2k)2k ~ l+P(2 ^~ 2 l l ~ p ^ l ~ P ~ 2k , 0<p<l<k, m>0. 

To construct a sequence C(k, m) which satisfies Lemma 5 it is enough to show that there 
exist such numbers D{m) >l,m = 0, 1,..., which satisfy the inequalities D{m) 2k > 

oo 

A(l,p, k, m) for all < p < I < k, m = 0, 1, . . . , and ] J D(m) < oo. Indeed, if there 

m=0 

(m— 1 
] [ D(p) 
P =o 

for m > 1, k > 0, satisfy Lemma 5, since the above defined numbers satisfy the inequality 

C(k,m + 1) 2 > C(2k-l-p,m)D(m) 2k > A(l,p, k, m)C(2k — l—p,m) 
for all < p < I < k. 
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To show that we can define numbers D{m) with the above required properties let 
us consider the extension of the sequence A(l,p, k, m) to the real valued function 

A ktm {u, v) = 2 ^- ^ 2k)2k+U ~ V {2 ^ V ~ V)3V ~ U ' 2 \ 0<u<v<k, 

and let us estimate its maximum. To get a simpler estimate let us bound the function 
A k ,m(u,v) in the following way. Observe that 



A k , m (u,v) < ( 2k * k u _ v ) V ~ % l A k , m (u,v) = 2^-™> {2k) 



2 k 



(2k-u-v) 2k - 2v (2v) 2v 

^ 2 2 ( 4— rn)v 



(2k) 2k 



{2k-2v) 2k - 2v (2v) 2v 
for < u < v < k, and this inequality implies that 

sup A kjm (u,v)< sup B kj7n (v) 

0<u<v<k 0<v<k 

with 

(2k) 2k 



B k , m (v) = 2 2 ^~ m > 



(2k-2v) 2k - 2v (2v) 2v ' 



Simple differentiation of the function log B k ^ m {y) shows that the function B kjm (v) takes 
its maximum in the point v = 2(m-4) _[_ i ' an< ^ B kjm (v) = (l + 2 4_m ) . 

The above calculations imply that the numbers D(m) = 1 + 2 4_m , m = 0, 1, 2, . . . , 

oo 

satisfy the inequality sup A{l,p,k,m) < D{m) 2k . The relation ] { D(m) < oo 

0<P<l<k rn=0 

also holds. Hence, as the argument at the beginning of the proof shows the numbers 
C(k,0) = 1, C(/c,m) = j [ D(p) ) for m > 1 satisfy the properties demanded in 

\p=0 

Lemma 5. 
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